A Note On Spectral Clustering

نویسندگان

  • Pavel Kolev
  • Kurt Mehlhorn
چکیده

Let G = (V,E) be an undirected graph, λk the kth smallest eigenvalue of the normalized Laplacian matrix LG of G, and ρ(k) (ρ̂(k)) the smallest value of the maximal conductance over all k disjoint subsets Z1, . . . , Zk (that form a partition) of V . Oveis Gharan and Trevisan [3] proved the existence of a k-way partition (P1, . . . , Pk) of V with ρ̂(k) 6 kρ(k). The k-way (approximate) partitioning problem asks to partition a graph into k clusters such that the conductance of each cluster is (approximately) bounded by ρ̂(k). Peng et al. [4] gave the first rigorous analysis of approximation algorithms for the k-way partitioning problem that are based on clustering suitably normalized eigenvectors of LG with the help of an approximate k-means algorithm. Their analysis relies on the following gap assumption: Υ , λk+1 ρ̂(k) > Ω(k). We strengthen the analysis in two directions. First, we improve the approximation guarantee by a factor of Θ(k) and second we require only a weaker gap assumption: Ψ , λk+1 ρ̂avr(k) > Ω(k), (1) where ρ̂avr(k) is the minimal average conductance over all k-way partitions achieving ρ̂(k). Furthermore, for graphs G that satisfy the gap assumption (1) with k = w(1), our improved analysis gives an algorithm running in time O(nk) that on input a suitable spectral embedding of V outputs with constant probability a k-way partition of V with identical approximation guarantees as in [4]. This speeds up the algorithm in [4] by a O(2)-factor. This work has been funded by the Cluster of Excellence “Multimodal Computing and Interaction” within the Excellence Initiative of the German Federal Government.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A note on spectral mapping theorem

This paper aims to present the well-known spectral mapping theorem for multi-variable functions.

متن کامل

Assessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories

In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...

متن کامل

Oil Reservoirs Classification Using Fuzzy Clustering (RESEARCH NOTE)

Enhanced Oil Recovery (EOR) is a well-known method to increase oil production from oil reservoirs. Applying EOR to a new reservoir is a costly and time consuming process. Incorporating available knowledge of oil reservoirs in the EOR process eliminates these costs and saves operational time and work. This work presents a universal method to apply EOR to reservoirs based on the available data by...

متن کامل

A note on the bounds of Laplacian-energy-like-invariant

The Laplacian-energy-like of a simple connected graph G is defined as LEL:=LEL(G)=∑_(i=1)^n√(μ_i ), Where μ_1 (G)≥μ_2 (G)≥⋯≥μ_n (G)=0 are the Laplacian eigenvalues of the graph G. Some upper and lower bounds for LEL are presented in this note. Moreover, throughout this work, some results related to lower bound of spectral radius of graph are obtained using the term of ΔG as the num...

متن کامل

A Note on Spectrum Preserving Additive Maps on C*-Algebras

Mathieu and Ruddy proved that if be a unital spectral isometry from a unital C*-algebra Aonto a unital type I C*-algebra B whose primitive ideal space is Hausdorff and totallydisconnected, then is Jordan isomorphism. The aim of this note is to show that if be asurjective spectrum preserving additive map, then is a Jordan isomorphism without the extraassumption totally disconnected.

متن کامل

A Clustering Based Location-allocation Problem Considering Transportation Costs and Statistical Properties (RESEARCH NOTE)

Cluster analysis is a useful technique in multivariate statistical analysis. Different types of hierarchical cluster analysis and K-means have been used for data analysis in previous studies. However, the K-means algorithm can be improved using some metaheuristics algorithms. In this study, we propose simulated annealing based algorithm for K-means in the clustering analysis which we refer it a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016